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Abstract. We study fixed parameter algorithms for three problems: 
Kemeny rank aggregation, feedback arc set tournament, and betweenness 
tournament. For Kemeny rank aggregation we give an algorithm with 
runtime o*(2° WOPT) ), where n is the number of candidates, OPT < (™) 
is the cost of the optimal ranking, and 0*(-) hides polynomial factors. 
This is a dramatic improvement on the previously best known runtime of 
O* (2°' OPT '). For feedback arc set tournament we give an algorithm with 
runtime O* (2°' %/OPT )), an improvement on the previously best known 
0*(OPT 0{VOPT) ) lAlon et all [2009( ] . For betweenness tournament we 

give an algorithm with runtime 0*(2 ^v / ° PT//ri ^), where n is the number 
of vertices and OPT < (™) is the optimal cost. This improves on the 

previously known O* (OPT° (opt1/3) ISaurabhl [20091 ]). especially when 
OPT is small. Unusually we can solve instances with OPT as large as 
n(logn) 2 in polynomial time! 

Keywords: Kemeny rank aggregation, Feedback arc set tournament, 
Fixed parameter tractability, Betweenness tournament 

1 Introduction 

Suppose you ran a chess tournament, everybody played everybody (a.k.a. round 
robin) and you wanted to use the results to rank everybody. Unless you were 
really lucky, the results would not be acyclic, so you could not just sort the 
players by who beat whom. A natural objective is to find a ranking that min- 
imizes the number of upsets, where an upset is a pair of players where the 
player ranked lower in the ranking beat the player ranked higher. Minimizing 
the number of upsets is called feedback arc set problem on tournaments (FAST). 

* A preliminary version of th is work appeared in version 1 of the arXiv preprint 

iKarpinski and Schudvl [2009( 1 . 
** Parts of this work done while visiting Microsoft Research. 
* * * Parts of this work done while visiting University of Bonn. 



The complementary problem of maximizing the number of pairs that are not 
upsets is called the ma ximum acyc l ic subgraph problem on tournaments. T hese 
probl ems are NP-har d [Ailon et all 120081 lAlonl . 120061 ICharbit et ail . 120071 ] (see 



also Conitzer , 2006 



but a polynomial-time approximation scheme (PTAS) 



Mathieu and Schudvl [2007| is known. 

In statistics and psychology, one motivation is ranking by paired compar- 
here, you wish to sort some set by some objective but 



Slater. 1961 



isons 

you do not have access to the objective, only a way to compare a pair and 
see which is greater; for example, determining people's preferences for t ypes of 
food. This problem attracted computational attention as early as 1961 Slaterl . 
196l| (for comparison Hoare published quicksort the same year). Feedback arc 
set tour nament and closely related problems have also been used in machine 
learning [Cohen et all . Il999l lAilon et all l2008j . 

The FAST problem can be generalized to a problem we call weighted FAST, 
sometimes known as feedback arc set with probability constraints. The input is 
a complete directed graph with arc weights {w uv } u _ v with w uv + w vu — 1 for 
every pair of vertices u, v. In other words a weighted FAST instance is a convex 
combination of unweighted FAST instances. 

We study the parameterized complexity of this problem, in particular the 
parameter OPT, the cost of an optimal ranking. We use the notation 0*(-) to 
hide factors that are polynomial in the input size, that is /(I) G 0*(g(I)) iff 
/CO < 9(I)\I\° f° r some c > and all sufficiently large inputs /. The first fixed- 
parameter algorithms for feedback arc set tournament had runtime O* (2°( OPT ^) 
and later algorithms made a dramatic improvement to O* {OPT°^ OPT ^ ) lAlon et al 
|2009| . We improve this to 0*(2°^ /UpT )). 

Theorem 1. There exists a deterministic parameterized subexponential algo- 

O(l) 



rithm for weighted FAST with runtime i°Wopt) _j_ n 
algorithm uses OPT°^ OPT ^ + time and space 



A variant of the 



The Exponential time hypothesis (ETH) llmpagliazzo and Paturil 2001 



is 



that 3-SAT cannot be solved in time 2°( numbcr of ™ ablcs ). We also give a match- 
ing lower bound assuming the ETH: 

Theorem 2. There does not exist a parameterized algorithm for weighted FAST 

with r untime O* {2°^ OPT ^) unless the exponential time hvvothesis \lmvaaliazzo and Patur, 

\200A l is false. 



We leave open the possibility that unweighted FAST may admit an exact 
algorithm with runtime O* (2 o( - VOPT ^ 



We note that independently of this work Uri Feige iFeigd [2009j ] gave an un- 
related algorithm for unweighted FAST matching Theorem [T] 

An important application of weighted feedback arc set tournament is rank 
aggregation. Frequently, one has acc ess to several ranki ngs of objects of some 
sort, such as search engine outputs [Dwork et all l200l| . and desires to aggre- 
gate the input rankings into a single output ranking that is similar to all of 



the input rankings: it should have minimum average distance from the input 
rankings, for some notion of dista nce. This anc ient p roblem was alread y stud- 

' 17851 



Bordal . Il78l| and ICondorcet 



the 



icd in the context of voting by 

18 th century, and ha s aroused renewed interest recently [Dwork et all 12001 



Conitzer et al. , 20061 ] . A natural notion of distance is the number of pairs of 



vertices that are in different orders, which is known as the Kendall - Tau dis- 
tance . This defines the Keme ny rank aggregation problem (KRA) [Kemenv 
19591 Kemenv and Snell . 1962| . This choi ce yields a m aximum likelihood es- 



timat or for a certain naive Bayes model Yound . 1995 1. This problem is NP - 



hard [Bartholdi et al 1 Il989t even wi t h onl y four voters [Dwork et all [2001], 



and has a PTAS iMathieu and Schudvl [2007] 



We denote the average distance between the optimal ranking and the in- 
put rankings by OPT < (™) . Two parameters have attacted the bulk of the 
study: OPT and the average Kendall- Tau distance between the input rank- 
ings. It is easy to see (triangle inequality) that these two parameters are within 
a constant factor of each other, so these parameters give equivalent runtimes 
up to constants i n the exponen t . All previous work give algorithms with run- 
time 0*f2°( 0Pr )') lBetzler et all |2009j . There is a standard reduction from Ke 



meny r ank aggregation to weighte d FAST lAilon et al.l [2008j . lCoppersmith et al 



2006f . lMathieu and Schudvl [2007} . so we improve the best known parameterized 



algorithm for KRA dramatically to O* (2°^ OPT ^) as a corollary of our Theorem 

HI 

Corollary 3. Let n be the number of candidates and OPT < (™) the optimum 
value. There exists a deterministic parameterized subexponential algorithm for 
Kemeny Rank Aggregation with runtime and space 2°( v ' OPT ) -f tjCK 1 ). A variant 
uses 0PT o( - VOPT ^ + 7i°W time and n°W space. 

Some other paramters have attracted attention. The parameter of maximum 
Kendall- Tau distance has been studied but yield bounds no tighter (up to con- 
stants in the expon ent) than is known for the average Kendall- Tau distance 



Betzler et all 20091 ]. Another parameter is the maximum r max , over candidates 



c and pairs of voters vi,V2, of the absolute differe nce between the rank of c in 
«i and v 2 - The best runtime known is O* f2 ^ max ^) iBetzler et"all [2009j . 



In the Betweenness problem we are given a ground set of vertices and a set 
of betweenness constraints involving 3 vertices and a designated vertex among 
them. The objective function of a ranking of the elements is the number of 
betweenness constraints for which the designated vertex is not between the other 
two vertices. The goal is to minimize the o bjective function. For the status 
of the general Betweenness p roblem, see e.g . Qpatrnv 1979j . Chor and Sudan 



1998| , lAilon and Alonl [20071 ] , ICharikar et~al1 [20091 ]. We refer to the Betweenness 



problem in tournaments, that is in instances with a c onstraint for eve r y trip le 
of vertices, as the Betw eennessTour 
This problem is NP-hard lAilon and Alo 



iroblem (see lAilon and Alonl [20071 ] ). 
20071 and has a recently discovered 



polynomial-time approximation scheme Karpinski and Schudv 2009j . We study 
its parameterized complexity. 



Theorem 4. There exists a randomized parameterized subexponential algorithm 
for BetweennessTour with runtime and space 2 0< V 0PT /™) • n ^, where n 
is the number of vertices and OPT is the cost of the optimal ranking. It succeeds 
with constant probability. 



The previously best known runtime was O* (2°( OPTl/3 lo s ° pr )) ISaurabbl [2009( | 



Our result is better by a logarithmic factor in the exponent for the largest pos- 
sible OPT = 0(n 3 ) and even better for smaller OPT. Interestingly we can solve 
all instances with OPT — 0(n log 2 n) in polynomial time! 

Our results easily generalize to a ll fully dense ranking CSPs of arity three with 
fragile constraints as introduced by iKarpinski and Schudv 2009| | . For simplicity 



we limit ourselves to the well-known problems discussed above. 

We now outline the organization of our paper. Section [2] discusses weighted 
feedback arc set tournament, including our algorithm (Section I2.1[) . analysis 
(|2.2|) . and lower bound (|2.3j) . Section [3] discusses our results for betweenness 
tournament, including our algorithm (Section l3.1|) and analysis (|3.2[) . 



2 Feedback arc set tournament 



2.1 Algorithm 



We now outline some of our key techniques. Firstly any two low-cost rankings 
for a FAST problem are nearby in Kendall- Tau distance. Secondly two rankings 
that are Kendall- Tau distance D apart are equivalent to within additive 0(\Hj) 
in how good each position for each a vertex is (Lemma [6]). Thirdly most vertices 
(in a low-cost instance) have a vee-shaped cost versus position curve and optimal 
rankings are locally optimal so we know that each vertex belongs at the bottom of 
its curve. The uncertainty in this curve by \f~D causes an uncertainty in the opti- 
mal position also around \f~D (Lcmmas[7]and|8|). Our algorithm simply computes 
uncertainties r(v) in the positions of all of the vertices v and solves a dynamic 
program for the optimal ranking that is near a particular constant-factor approx- 
imate ranking. We rema rk that Brave r man and M ossel iBraverman and Mossel 



2008] and Betzler et al. iBetzler et al.l [20081 12009| previously applied dynamic 



programming to FAST and KRA. 

First we state some core notation. Throughout this paper let V refer to the 
set of objects (vertices) being ranked and n denote \ V\. Our O(-) hides absolute 
constants only. Our 0*(-) hides a polynomial in n. A ranking is a bijective 
mapping from a set S C V to {1, 2, 3, . . . , \S\}. We call ir(v) the position of 
v in the ranking tt. We let <i(7r,7r') denote the Kendall- Tau distance between 
rankings tt and tt' , i.e. the number of pairs of vertices in different orders in the 
two rankings. An ordering is an injection from S into K. We use tt and a (with 
superscripts) to denote rankings and orderings respectively. 

The input to weighted FAST is a set V of vertices and arc weights {w uv } UyV& v 
such that w uv + w vu = 1 for all u 7 v e V. The FAST objective function is 
the weight of the backwards arcs C(n) = Y, u ,veV:ir(v)>w(u) Wvu - For ranking tt, 



Algorithm 1 Exact algorithm for FAST. If dynamic programming is used in the 
last line the runtime and space are both n°^2°^ OPT K If divide-and-conquer 
is used the runtime is n °WOPT) an( j space is . 
Input: Vertex set V, arc weights {«) m }«,»ey. 

1: Sort by weighted indegree ICoppersmith et all [20061 ] . yielding ranking tt 1 of V. 

2: Set r(v) = 4 v /2C(tt 1 ) + 26(tt 1 , v, tt 1 (v)) for all veV. 

3: Use dynamic programming or divide-and-conquer (Details: Lemma [9)l to find the 
optimal ranking tt 2 with |vr 2 (t;) — tt 1 (^;) | < r(v) for all v. 



vertex v € V and p £ M (with tt(u) ^ p for all 11 ^ u) wc define b(n,v,p) = 

V\ ,„ | Wvu H^* ^ ""W j_ e _ cog j. £ arcs inpjfjeni; t w j n the ordering 
^ u ^ v \ w uv \ip< n(u) ' 6 

formed by moving v to position p in tt. Let 7r* denote an optimal ranking and 

OPT = C{tt*) its cost. 

Before running our main Algorithm [TJ we compute a small kernel, that is 
a smaller instance with the same optimal cost as the input instance (up to a 
known shift). This preliminary step allows us to separate the dependence on n 
and OPT in th e runtime, yielding the runtime stated in Theorem [TJ 
Dom et al 



Pom et al.l 



yielding 
. 2006] 



give an algorithm for computing kernels of 
unweighted FAST instances with OjOPT 2 ) vertices. This was later improved 



to 0(OPT) vertices by Bessy et al. Bessy et al.l 2009 1. There is a kernelization 
algorithm for Kemeny rank aggregation in iBetzler et al. |2009j , but it produces 
an instance of size 0((Number of voters) • OPT), not the desired OPT° {1 \ To 
get the desired ker nel for general weig hted FAST we consider a slight variant of 
the algorithm from iDom et ahl [2006] . 

Lemma 5. There is polynomial-time computable O (OPT 2 ) -vertex kernel for 
weighted FAST. 

Proof (sketch). Let OPT < U < 50PT be the cost of a 5-approximate ranking 



Coppersmith et al. .2006] 



We say that an arc is a majority arc if it has greater weight than its reverse, 
with ties broken arbit rarily. A majority arc clearly has weight at least 1/2. The 



majority tournament lAilon et ahl [2008] is the unweighted directed graph with 
vertex set V and arc set equal to the majority arcs. 

Our kernelization algorithm is simple: we apply the following two reduction 
rules, which are extensions of two reduction rules in lDom et al. 2006] . as often 
as possible. 

The first reduction rule is eliminating a vertex that is part of no cycles of 
three arcs in the majority tournament. Consider some such vertex v. It is easy 
to see that there exists an optimal ranking that puts every predecessor of v (in 
the majority tournament) before v and every successor of v after v, while implies 
the validity of this rule. 

The second reduction rule concerns an arc (it, v) of the majority tournament 
that is in more than 2U cycles of three arcs in the majority graph. Any feedback 



arc set not not paying for such an arc must pay for more than 20PT other 
arcs of the majority tournament, each of cost at least 1/2, and hence cannot be 
optimal. Therefore we record that we must pay w uv , then set weight w uv to zero 
and w VM to one. 

Now we argue that the resulting instance after these two rules are exhaus- 
tively applied has 0(OPT 2 ) vertices. An optimal feedback arc set, which nec- 
essarily has cost OPT, can include at most 20PT majority arcs. Each such 
majority arc is in at most 10OPT triangles by the second rule, so there are at 
most 2QOPT 2 triangles. Finally by the first rule every vertex is in a triangle, so 
there are at most 60OPT 2 vertices. 



We have no t investigated whet her or not the 0(OPT) vertex kernel for un- 
weighted FAST iBessv et all |2Q09{ can be extended to weighted FAST. 



2.2 Analysis 



Variants of the following Le mma are given in iMathieu and Schudv 2007 1 and 



Karpinski and Schudvl [2009|. We give a simplified proof here for completeness. 



Lemma 6 (jMathieu and Schudv [2007], Karpi nski and Schudv [2009]). 



Letir andir' be rankings overV. Itfollows that\b(n,v,p)—b(jr',v,p)\ < 2\J d(n, tt') 
for ally eV and p e K \ Z. 

Proof. Fix v £ V , p E R\Z and rankings 7r,7r'. Consider the sets of vertices L = 
{ u E V \ {v} : tt(u) < p < tt'(u) } and R = { u G V \ {v} : tt'(u) < p < tt(u) }. 
Intuitively these are the vertices that cross p from left to right (resp. right to 
left) when going from tt to tt' . It follows easily from the definition of b that 
\b(?T,v,p) — b(n',v,p)\ < \L\ + \R\, so we now proceed to bound \L\ and \R\. 

The bijective nature of tt and tt' implies that \L\ = \R\. Observe that all 
vertices in L are before all vertices in R in tt, and vice versa for tt' , hence 
d(TT,TT r ) > |L||iZ|. Putting these facts together proves the Lemma. 

Lemma 7. In Algorithm]]} we have \tt* (v) — 7r 1 (u)| < r(v) for all v € V and 
any optimal ranking tt* of V . 

Proof. The weight of an arc and its reverse sum to one so ^(Tr*,^ 1 ) < C(tt*) + 
C^tt 1 ) < 2C(tt 1 ). By Lemma © therefore 



\b(TT*, v, j + 1/2) -b(n 1 ,v,j + 1/2)| < 2 v / 2C(tt 1 ) (1) 



for any j £ Z. 



Fix v € V. We conclude 



|7T*(w) — 7T 1 (u) | 

< fr^ 1 , v, tt* (v)) + b^ 1 , v, 7T (u)) (w uv + w vu = 1 for all u) 
= 6(7r 1 ,-y,7r*(w) + 1/2) + b^ 1 , v , n 1 (v) + 1/2) (tt 1 is integral) 

< b(ir*, v, tt» + 1/2) + 2 v /2C( 7 r 1 ) + b(ir\ v, tt^v) + 1/2) (By ©) 



< 6(tt* , v, n 1 (v) + 1/2) + 2 V2CV) + ^tt 1 , u, tt 1 (u) + 1/2) (Optimality of tt* ) 

< 4V2C(tt 1 ) + 26(7r 1 ,«,7r 1 (u) + 1/2) (By©) 

= r(f) (Definition of r(v)). 

Lemma 8. In Algorithm^ we have max.,- eZ \{v G V : |7r 1 (w) — j\ < r(v) }\ — 
O(VOPT). 

Proof. Fix j G Z. Let R = { v G V : 1 7T 1 (i;) — j | < r(u) }, the cardinality of which 
we are trying to bound. We say v € V is pricey if 26(7r 1 , u, 7r 1 (w)) > y / 2C(7r 1 ). 
Clearly 2C(7r 1 ) = J2 V K 71 " 1 ! u i 7rl ( u )) ^ (number pricey) ^y / 2C(7r 1 ) hence the 
number of pricey vertices is at most 2C \ 7r ^ = 2\/2C(tt 1 ). All non-pricey 

* J (1/2)^/20(^1) V \ ■> 

vertices in i? have Itt 1 ^) - j| < r(u) < 5 v /2C(?r 1 ), so at most 10^/2C(tt 1 ) + 1 
non-pricey vertices are in i?. We conclude \R\ < \2J2 C{tt 1 ) + 1 = 0{y/OPT) 



since tt 1 is a 5- approximation Coppersmith et al. |2006l |. 



Lemma 9. There is a dynamic program for FAST that finds the optimal rank- 
ing tt 2 with |7r 2 (t>) — tt 1 (w)| < r(v) for all v using space and runtime 0(\V\ 2 )2^ , 
where ip = max-,- \ {v € V : |7r 1 (w) — j\ < r(v) }|. A divide and conquer variant 
uses space. 

Proof. Say that a set S" C V is i;a/id if it contains all vertices v with tt 1 {v) < 
|5| — r(w) and no vertex v with tt (v) > \S\ + r(v). Observe that for any s £ N 
all valid sets of size s agree except for the presence or absence of ijj vertices. 
Therefore there are at most n2^ valid sets. 

We say that a ranking tt of valid set S is valid if { v : ir(v) < j } is a valid 
set for all < j < \S\. It is easy to see that a ranking tt is valid if and only if 
satisfies \ir(v) — 7r 1 (u)| < r(v) for all v. 

One can easily see the following optimal substructure property: prefixes of 
an optimal valid ranking are optimal valid rankings themselves. 

For any valid set S let C(S) denote the cost of the optimal valid ranking of 
5*. The recurrence relation is 



C(S) = min 

v£S:S\{v} is valid 



C(s\{v})+ w ™ 

ues\{v} 



The space-efficient variant ev aluates C using di vide and conquer instead of 



dynamic programming, similar to iDom et al.l 2006]. Details deferred 



Now we put the pieces together and prove Theorem [TJ 

Proof (of Theorem^. The kernelization algorithm of Lemma [5] allows us to 
assume without loss of generality that n — 0(OPT 2 ). Algorithm [T] returns an 
optimal ranking by Lemmas [7] and [9l Lemmas [8] and |9] allow us to bound the 
runtime and space requirements of the dynamic program. 



2.3 Lower bound 

Proof (of Theorem^). For sake of contradiction suppose we have an algorithm 
for weighted FAST with runtime 2°^ OPT ^ . We present a series of reductions 
which converts such an algorithm into a subexpontial-time alg orithm for vertex 



cover, the existence of which is known to contradict the ETH Flum and Grohe 



200C 



Let an instance of vertex cover with n ve rtices be giv en. Applying Karp's 
reduction from vertex cover to feedback arc set lKard [l972j produces a feedback 



arc set instance with 2n vertices. Finally one can reduce this to a weighted FAST 
instance with the same number of vertices by representing incomparable pairs of 
vertices by opposite arcs of weight 1/2. The result is an weighted FAST instance 
with 2n vertices that is equivalent to the original vertex cover instance. The 
optimal cost for this instance is at most its number of arcs, which is 0(n 2 ), 
so the hypothesized algorithm has runtime 2°^ OPT ^ = 2°( n \ This runtime is 
subexponential, contradicting the ETH. 



3 Betweenness tournament 
3.1 Algorithm 

We now introduce some new notation for the betweenness problem. We let (^) 
(for example) denote the standard binomial coefficient and (Yj denote the set of 
subsets of set V of size k. For any ordering a let Ranking(cr) denote the ranking 
naturally associated with a. 

Let v i — ^ p denote the ordering over {v} which maps v to p. For set Q of 
vertices and ordering a with domain including Q let Q t-¥ a denote the ordering 
over Q which maps u G Q to o~(u), i.e. the restriction of a to Q. For orderings 
o~ and a 2 with disjoint domains let a 1 \a 2 denote the natural combined ordering 
over Domain{a ) U Domain(o~ 2 ). For example of our notations, Q i— > a\v i— > p 
denotes the ordering over Q U {v} that maps v to p and u G Q to <j(u). 

A ranking 3-CSP consists of a ground set V of vertices and a constraint 
system c, where c is a function from rankings of 3 vertices to [0, 1]. For brevity we 
henceforth abuse notation and and write c(Ranking((r)) by c(<r). The objective of 
a ranking CSP is to find an ordering a (w.l.o.g. a ranking) minimizing C(o~) = 
Y^g e foomain( a )\c(S H> a). We will only ever deal with one constraint system 

c at a time, so we leave the dependence of C on c implicit in our notations. 
Abusing notation we sometimes refer to S C V as a constraint, when we really 



Algorithm 2 Our algorithm for BetweennessTour. The runtime is 

n O(l) 2 0(^OPT/n)^ 
Input: Vertex set V 

1: Use the Algorithm from Theorem 1 101 to construct a set of rankings 77 

2: Let -K 9 °° d be the ranking from 77 with lowest cost 

3: for each tv 1 G 77 do 

4: if CV 1 ) < 2C(ji 3 ° od ) then 

5: Set r(v) = u\ \J C(7r 1 )/n + Uib(TT , v, 7r 1 (v))/n for all v £ V , where ai and a.2 

are absolute constants. 
6: Use dynamic programming (see Lemma|5} to find the optimal ranking 7r 2 with 

\tr 2 (v) — 7r 1 (")| < r(y) for all v. 
7: end if 
8: end for 

9: Return the best of the ir 2 rankings. 



are referring to c(S n- •). Clearly one can model BetweennessTour as a 
ranking 3-CSP. 

Let b(o,v,p) = ^2q.... c(Q H t a\v H >• p), where the sum is over sets Q ^ 
Domain (&)\{v} of size 2. Note that this definition is valid regardless of whether 
or not v is in Domain (a). The only requirement is that the range of a excluding 
<j(v) must not contain p. This ensures that the argument to c(-) is an ordering 
(injective) . 

Our algorithm and analysis for BetweennessTour are analogous to our 
results for FAST with two major differences. Firstly no kernel for betweenness 
tournament is known, which hurts the runtime somewhat. Secondly we use a 
more complicated approach to get the pr eliminary constant-factor app roxima- 
tion ranking tt 1 . We use the known PTAS IKarpinski and Schudvl 2009| with an 
appropriate error parameter to get a 2-approximation. Our analysis requires not 
only that tt 1 be of cost comparable to tt* but also that it be close in Kendall- 
Tau d istance. Fortunately the analysis of the PTAS from IKarpinski and Schudy 
2009] supports the following theorem. 



Theorem 10 (|Karpinski and Schudvl [2009]). There exists a polynomial- 
time algorithm for BetweennessTour that produces a set II of 0(1) rank- 
ings. With constant probability one of the rankings n £ 77 satisfies d(it, n*) — 
O(OPTfn) and has cost at most 2C(ir*), where n* is some optimal ranking. 



3.2 Analysis 

The following two lemmas are given in Karpinski and Schudy [2009( | in more 
generality. We give simplified proofs here for readability. 

Lemma 11 (jKarpinski and Schu dv [20Q9|). For any rankings tt and tt' over 
vertex set V , vertex v € V and pSRroe have 



\b(ir, v,p) - b(ir', v,p)\ < 3(ra - l)\/d(n,ir'). 



Proof. Fix 7r, 7r', v, and p. As in the proof of Lemma [51 consider the sets of 
vertices L — { u G V \ {v} : n(u) < p < tt'(u) } and R = { u 6 V \ {v} : 
tt' {u) < p < tt(u) }. From the definition of b we see that a constraint {u,u',v} 
contributes identically to b(TT,v,p) and b(Tr',v,p) unless either: 

1. {u,u'} and (L U R) have a non-empty intersection (or) 

2. 1 (tt(u) < tt(u')) ^ 1 (tt'(m) < tt'(u'))- 



In the proof of Lemma [5] we showed that \L\ = \R\ < J d(TT, 7r'). We can 
therefore bound 



|6(7r, «,p) - 6(tt', v, j>)| < (2Vd(7r^0)(n - 2) + d(7r, tt'). (2) 

Clearly d(7r,7r') < n(n-l)/2, hence d(7r, tt') = (^(tt, tt')) 2 < y/d(ir,Tr')y/^^- < 

(n — 2)^Jd(it,'K') for sufficiently large n. Substituting this inequality into the sec- 
ond term of ([2]) proves the Lemma. 

Lemma 12 (]Karpinski and Schudvl (2009j |). Let tt be a ranking ofV, \V\ 



n, v G V be a vertex and p,p' G M. Let B be the set of vertices (excluding v) 
between p and p' inn. Then b(w, v,p) + 6(71", v,p') > C" - ^)!- 5 ! . 

Proof. By definition 

b(ir,v,p) + b(n, v,p') = V" [c(Q i-> 7r|« ^ p) + C (Q i-» 7r]u ^ p')] (3) 
Q-— 

where the sum is over sets Q CV \ {v} of 2 vertices. Observe that betweenness 
tournament has a special property: the quantity in brackets in ([3]) is at least 1 
for every Q that has at least one vertex between p and p' in tt. There are at least 
\B\(n - 2)/2 such sets. 

Lemma 13. During the iteration of Algorithm^ that considers the ranking with 
d^ 1 ,^*) = 0(OPT/n) andC^ 1 ) < 2C(tt*) guaranteed by Theorem{W[we have 
\tt*(v) - ^(u)! < r(v) for all v G V. 

Proof. By Lemma [TT] and Theorem 1101 we have 



\b(TT*,v,j + 1/2) - b(7T\v,j + 1/2)| = 0(n^OPT/n) (4) 

for any j G Z. 

Fix v G V. We conclude 

n — 9 

k»-7T»| — 

< ^TrWOO + 1/2) + b(TT 1 ,v,ir*{v) + 1/2) (Lemma[l2| 



< 6(tt* , v, tt* (v) + 1/2) + O(VnOPT) + b^ir 1 , v, tt 1 (v) + 1/2) (By Q) 



< b{TT* , v, tt 1 (v) + 1/2) + 0(VnOPT) + b^ 1 , v, tt 1 (v) + 1 /2) (Optimality of tt* ) 



< 0(\/nOPT) + 26( 7 r 1 ,w,7r 1 (w) + 1/2) (By©) 
n — 2 

= r(v) (Definition of ?*(?;)). 



Lemma 14. In Algorithm^ we have maxjgz |{ v € V : |7r 1 (w) — j\ < r(v) }| = 
O(^CW)Jn)- 

Proof. We proceed analogously to the proof of LemmaH) Fix j. Let R = { v € V : 
|7r 1 (u) — j\ < r(v) }, whose cardinality we are trying to bound. We say v S V is 
pricey if &(tt\ v, tt 1 (w ))/n > v / 2C(ir 1 )/n. Clearly 3C(tt 1 ) = ^HirV.ir 1 !")) > 
(number pricey) ny^2C(ir 1 )/n hence the number of pricey vertices is at most 
3C( 7 r 1 )/( v /2nC'(7r 1 )) = O^C^/n). All non-pricey vertices in R have Itt 1 ^)- 
j| = 0(y / C(TT 1 )/n), so 0{'\jC('K l )/'n) non-pricey vertices are in R. We conclude 
\R\ = 0(VC(7ri)/n). 

Lemma 15. There is a dynamic program for betweenness that finds the opti- 
mal ranking tt 2 with \tt 2 (v) — tt (v)\ < r(v) for all v, with space and runtime 
0(|V r | 3 2^) where ip — maxj \{v £ V : |7r — j\ < r(u)}|. A divide and con- 
quer variant uses \V\°^ time and \V\°^ space. 

Proof. As in the proof of Lemma[9]we say that a set S C V is valid if it contains 
all vertices v with ir 1 ^) < IS"! — r(v) and no vertex v with 7r 1 (w) > IS"! + r(v). 
Observe that for any s € 1 the valid sets of size s are identical except for the 
presence or absence of at most i[> vertices. Therefore there are at most n2^ valid 
sets. 

We say that a ranking tt of valid set S is valid if { v : tt(v) < j } is a valid 
set for all < j < \S\. It is easy to see that a ranking tt is valid if and only if 
satisfies \tt(v) — 7T 1 (w) | < r(v) for all v. 

The dynamic program based on C(-) that worked for FAST docs not ap- 
pear to directly generalize to BetweennessTour. We consider an alternate 
approach. For any ranking tt over S let C'(tt) denote the portion of the cost 
shared by all orderings with prefix tt. That is, the cost of all constraints with at 
most 1 vertex outside 5*. One can easily see the following optimal substructure 
property: prefixes of an optimal (w.r.t. C") valid ranking are optimal (w.r.t. C) 
valid rankings themselves. 

For any valid set S let k(S) denote the C cost of the optimal (w.r.t. C) 
valid ranking of 5*. The recurrence relation is 

C'(S\{v})+ H c{u ^ l\v ^ 2\q ^ 3) . 

u<£S\{v} q£V\S 

Proof (of Theorem^. Lemmas [T51 and [T4l plus the test of the "if" in Algorithm 
[2J allow us to bound the runtime and space requirements of the dynamic program 
used by Algorithm [2] by 7 - l ( 1 )2 0( V c '( 71 ' s< "" i )/ n ) ; which is of the correct order since 
C(TT 9 ° od ) < 2C(tt*). The "for" loop is over a constant number of options and 
hence does not impact the runtime. 

For correctness we focus on the iteration of Algorithm [2] that considers the 
tt 1 £ II with d(7r\7r*) = 0(y/C(ir*)/n) and C(tt 1 ) < 2C(tt*) as guaranteed 
by Theorem [TO] Theorem [TO] ensures C^tt 1 ) < 2C(vr*) < 2C(7rf ood ) and hence 
the "if" is passed. By Lemma 1 131 7r* is among the orders the dynamic program 
considers. 



k(S) = min 

v£S:S\{v} is valid 
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